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ABSTRACT 

We measure the angular 2-point correlation functions of galaxies, uj{0), in a volume 
limited, photometrically selected galaxy sample from the fifth data release of the Sloan 
Digital Sky Survey. We split the sample both by luminosity and galaxy type and use 
a halo-model analysis to find halo-occupation distributions that can simultaneously 
model the clustering of all, early-, and late-type galaxies in a given sample. Our results 
for the full galaxy sample are generally consistent with previous results using the 
SDSS spectroscopic sample, taking the differences between the median redshifts of the 
photometric and spectroscopic samples into account. We find that our early- and late- 
type measurements cannot be fit by a model that allows early- and late-type galaxies 
to be well-mixed within halos. Instead, we introduce a new model that segregates 
early- and late-type galaxies into separate halos to the maximum allowed extent. We 
determine that, in all cases, it provides a good fit to our data and thus provides a new 
statistical description of the manner in which early- and late-type galaxies occupy 
halos. 

Key words: cosmology: observations — galaxies: halos 



1 INTRODUCTION 



The 'halo-m odel' (see, e.g.|KaufFmann et al.|1997| Peacock & 



Smith 2000 : ' Cooray fc Sheth|2002[ [Zheng et al.|2005| [Tinker 
et al. 2005) has been developed to allow one to precisely 
model the clustering of galaxies. One can fill dark matter 
halos with galaxies based on a statistical 'halo-occupation- 
distribution' (HOD), allowing one to model the clustering 
of galaxies within halos (and thus non-linear scales) while 
providing a self consistent determination of the bias at linear 
scales. Thus, as shown by, e.g., Zehavi et al. (20041, Blake et 



al. ( 2008 1, Tinker et al. ( 2008 1, one can use measurements of 



galaxy 2-point correlation functions to constrain the HOD of 
different sets of galaxies and gain information on the nature 
in which galaxies occupy dark matter halos. 

Such a halo-model analysis can be particularly useful in 
constraining the clustering of early- and late-type galaxies. 
It has long been known that early-type galaxies cluster more 
strongly than late-type galaxies (recent studies include, e.g., 



Willmer et al.[[1998[ [Norberg et al.[[2002[ [Madgwick et al.[ 


2003 


Zehavi et al. 2005 


Ross et al. 2006 


Croton et al. 2006 


1, 



and that there exists a corresponding morphology-density 
relationship ( |Dressler|1980| ) which essentially states that the 
fraction of early-type galaxies increases with the density of 
the local environment. Zehavi et al. (2005; hereafter Z05) 
have incorporated this relationship into their halo modeling 
(by allowing the fraction of late-type galaxies to decrease as 



a function of halo mass) and shown that this approach can 
indeed effectively model the clustering of early- and late- 
type galaxies. 

Recent studies have determined that the morphology- 
density relationship can be more accurately described as a 



color-density relationship. Ball et al. ( j2008) find no resid- 
ual relation between density and morphology when remov- 
ing color (but do find a strong residual in density and color 
when removing morphology) and Skibba et al. ( 2008 1 find a 
strong environmental dependance on color, even for fixed 
morphology. This implies that deep photometric surveys 
(which are likely to have little morphological information) 
should be sufficient for quantifying the clustering as a func- 
tion of galaxy type. 

In this work, we use galaxies that are photometrically 
selected from the Sloan Digital Sky Survey (SDSS) fifth data 



release (DR5) to constrain HODs. Blake et al. (20081 have 
previously used photometric data to constrain the HODs of 
luminous red galaxies (LRGs), and we follow a similar ap- 
proach to constrain the HODs of early- and late-type galax- 
ies. The wealth of quality photometric data allows us to pre- 
cisely constrain the HODs of early- and late-type galaxies 
at higher redshifts than previous SDSS studies (e.g. Z05). 

Our paper is outlined as follows: ^ describes how we 
use the halo model to obtain model angular 2-point corre- 
lation functions of galaxies; ^describes how we both select 
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galaxies from the SDSS DR5 photometric catalog to pro- 
duce a volume limited sample (to z — 0.3), which we fur- 
ther subdivide by type and luminosity, and also estimate the 
redshift distribution of these galaxy samples; ^ describes 
how we measure the 2-point correlation functions of galax- 
ies and how we estimate the error on these measurements; ^ 
presents the results of our 2-point correlation function mea- 
surements and the best-fit HOD for each galaxy sample; in 
^ we compare our results to previous studies and discuss 
the implications of our measurements; finally, we conclude 
in ^ Throughout this work, we assume a flat cosmology 
with r2„ = 0.28, h = 0.7, erg = 0.8, F = 0.15 (as used in 
Ross et al. 2007; hereafter R07). 



2 HALO MODELING 

We use the halo model to produce model angular 2-point 
correlation and cross-correlation functions. The most ba- 
sic component of the model is the number density of ha- 
los at redshift z with mass M, n{M, z). We determine both 
n{M, z) and the bias of these halos, B(M, z), by using an el- 



lipsoidal collapse model (e.g., Sheth et al. 2001 1 coupled with 
the methods described in Nishimichi et al. ( 2006 1. We model 



the probability distribution of the number of galaxies occu- 
pying a halo of a given mass, the halo- occupation- distribution 
(HOD), to determine the mean number of galaxies, N{M), 
occupying halos of mass M. Following [Zheng et al] ( [20051 ) 



and Blake et al. (20081, we assume separate mean occupa- 



tions for central galajcies, Nc{M) and for satellite galaxies, 
N,{M). Thus, 



iV(M) = N4M) x{l + NsiM)) 



(1) 



Note what this implies: for a given halo mass, the mean 
number of satellites is Nc{M)Ns{M), but these satellites are 
broken between halos which have a central galaxy and those 
that do not. The halos that do have a central galaxy have 
Ns{M) satellite galaxies, and those without a central galaxy 
have zero satellites. This reasoning is crucial to understand- 
ing the equations presented throughout this section. 

Coupling this model with a formalism describing how 
galaxies distribute themselves within halos allows us to 
model the power spectrum (which we Fourier transform and 
then convert to lli{6) via Limber's equation; |Limber|[1954[ ). 
We assume that the spatial distribution of satellite galaxies 



follow the Navarro, Frenk, & White ( 1997 1 dark matter halo 
density profile: 



p(M,r) = 



M 

{cr/r„i,,{A<f))(l + cr/r„ir(M))2 



(2) 



47r{r„i^{M)/c)^[ln{l + c)-c/(l + c)] 

where rvir is the virial radius and c is the concentration pa- 



rameter. Following Zehavi et al. (20041, we define 200 as the 



critical over-density for virialization and can thus express 
the virial radius, r„ir- as a function of mass as 



3M 



200 X 47rp 



and inversely the virial mass as a function of scale as 
M^ir{r) = 200 X ^Trr^p 



(3) 



(4) 



where p is the mean co-moving background density of the 
Universe. Using this definition for the virial mass, [Zehavi et[ 
al. (20041 determined via Bullock et al. (2001 1 that c can be 



expressed 

c{M,z) = 11/(1 + z)(A//Me)"''-" 

where we determine logio(Afc) = 
units {h~^MQ), for our assumed cosmology (see, e.g. Blake 



(5) 

12.49, where Mc is in 



et al. 20081. We use the Fourier transform of the full (un- 
truncated) p{M,r), u{k\A4), to calculate the power spec- 
trum, and we use the form presented in [Scoccimarro et al. 



( 2001 1. We note that the difference in using the 



Scoccimarro 



et al. ( 2001 1 parameterization, and one in which p{M, r) is 
truncated (using, e.g.. Equation 8 from [Jain et al.|2003[ with 
r„ir as upper bounds on the integrals) is negligible (< 0.1%) 
in our model angular correlation functions. 

2.1 Modeling the Power-Spectrum 

The equations presented above allow us to model the power 
spectrum as having a contribution due to galaxies in two 
separate halos (the 2-halo term) and a contribution due to 
galaxies that reside in a single halo (the 1-halo term). 



P{k,r) = Pih(k) + P2h{k, 



(6) 



where P\h{k) is split into two components — one being the 
power-spectrum due to central-satellite pairs, Pcs{k), and 
the other due to satellite-satellite pairs, Pss{k): 



Pcs{k) 



Pss{k) 



dMn{M)Nc{M) 



2Ns{M)u{k\M) 



At) 



dMn{M)N^{M) 



{Ns(M)u{k\M)Y 



(7) 



(8) 



and 

P2h(k,r) =Prr. 



jMh™W dMn{M)b{M,r)^u{k\M) 



(9) 



where Pmatter is the matter power-spectrum determined via 
the fitting formulae of Smith et al.| ([2003). The parame- 
ter Miimir) is the mass limit due to halo-exclusion, which 
we determine using the methods described by [Tinker et al.| 
(20051 and Blake et al. (20081. The average number density 



of galaxies is given by rig, and n'g is the restricted number 
density of galaxies. The two number densities can be ex- 
pressed as 



AMn{M)N{M) 



and 



AMn{M)N{M) 



(10) 



(11) 



The scale dependent bias, b(M, r), can be expressed (Tinker 
et al.|2005 l as a function of the halo bias as 

[l + 1.17U(r)]i-*« 



b\M,r) = B\M) 



[l + 0.69e™(r)]2-09 



(12) 



where €,m{r) is the non-linear real-space matter 2-point cor- 
relation function, determined by Fourier transforming the 
matter power spectrum. 
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In many cases, it will be useful to calculate the effective 
mass, Meff, given by: 



M, 



1 



dMMn{M)N{M) 



(13) 



and also the overall bias of the galaxies given by the model, 
bgai given by: 



bgal - 



1 



AMB{M)n{M)N{M) 



(14) 



These parameters are quite useful when comparing our re- 
sults to each other and also to previous studies. 



2.2 Halo Occupation Distribution Model 

We model the HOD as a power-law with a softened tran- 
sition for both the central and satellite galaxies (note that 
this implies the softening effect is squared for the satellite 
galaxies). This is expressed as 



iVc(M) = 0.5 



Ns{M) = 0.5 



1-1- erf 



1 + erf 




(15) 



(16) 



These Equations are entered into Equation 1 to determine 
the mean occupation of halos at a given mass. 

The HOD model has four free parameters, but one can 
be removed by requiring that Ug match the observed number 
density of galaxies. To ensure this, we determine Mcut for 
any chosen combination of Ocnt, Mo, and a. We also model 
the number of early- and late-type galaxies by expressing 
the fraction of late-type centrals and satellites as a function 
of halo mass (similar to Z05). 



/,(M) = /eo exp 
and 

fs{M) = /,o exp 



-logio(M/Mc„t 



-\ogw{M/Mo) 

O'sat 



(17) 



(18) 



where we cap fs{M) and fc{M) (which we will from here on 
express as fs and /c in order to be concise) such that they 
are never greater than 1. 

This model once again has four free parameters, but we 
can remove one since we know the overall fraction of late- 
type galaxies. We thus calculate the required fco for every 
allowed combination of fso, o-cen, and aaat- 

In a previous work (Z05), color selected "red" and 
"blue" galaxies were assumed to be well mixed within ha- 
los. We find that this type of model does not provide an 
adequate fit to our measurements of the auto-correlations of 
early- and late-type galaxies. We instead assume that if a 
central galaxy is a certain type, its satellite galaxies will be 
the same type, up to the extent allowed by the N{M) and 
f{M) statistics of the given model. In order to be concise, 
we express the P{k) in terms of a new function 0(fc, M): 



dMe{k, M)n{M)Nc{M)Ns(M)u{k\M)/n 



where type can be either early or late. Equations [7] and [8] 
now become dependent on type and the relative values of 
fc and fs (which are themselves dependent on mass). If the 
late-type satellite fraction is greater than the late-type cen- 
tral fraction, all satellite galaxies around late-type central 
galaxies will be late-types (and thus each late-type central 
will have A''^, late-type satellites). If the opposite is true, 
each early-type central galaxy only has early-type satellites. 
For the average late-type central, the fraction of its satellites 
that are also late-type is the total fraction of late-type satel- 
lite galaxies (fs) divided by the fraction of late-type central 
galaxies (/c). Thus, the central-satellite term for late-types 
can be expressed as: 



ecs,lat.{k,M) =2/, {.fs>fc) 

= 2/, X fjf, = 2/, {fs < /c) 



(20) 



For the central-satellite term for early- type galaxies, we 
must take into account the fact that there will be late-type 
satellite galaxies around early-type centrals if fs > fc- In 
this case, we need to determine the fraction of early-type 
satellites around early-type centrals. The total fraction of 
early- type satellites is just 1 — fs. Thus the average fraction 
of satellites around early-type centrals that are early-type is 
(1 — /s)/(l — /c) and the total contribution due to early- type 
satellites around early-type centrals is (1 — /c)(l — fs)/{l — 
fc). For fc > fs, all satellites around early- type galaxies are 
early-type and each early-type central thus has Ns early- 
type satellites. The central-satellite term for early-types can 
therefore be expressed as: 



e 



cs, early 



ik,M) 



2(1 
2(1 



fc) 



ifs 



>/c 



(21) 



For the satellite-satellite terms, the same logic applies. 
If fs > fc there will be a term for both late-type satellites 
around late-type centrals and late-type satellites around 
early-type centrals. Around late-type centrals, all satellites 
are late-type, and the fraction of late-type satellites is just 
1. The total fraction of late- type satellites around early- type 
satellites is fs — fc and the fraction around only halos with 
early-type central galaxies is thus {fs — fc)/{^ — fc). This 
term must be squared to account for the total number of 
late-type satellite-satellite pairs around early-type galaxies. 
The total contribution due to halos with early-type centrals 
is thus (1 - fc) X \{fs - fc)/{l - fc)f. If instead fc > fs, 
there are only late-type satellite galaxies around late-type 
centrals, thus the contribution is fc x {fs/ fc)^ . We therefore 
express Oss,iate as: 



<dss,late{k,M) = 

[fc + {fs - /c)V(l - fc)] Ns{M)u{k\M) {fs > fc) 

{f'i/fc)Ns{M)u{k\M) {fs < fc) 



(22) 



g.typej 



(19 



For the early-type satellite-satellite term, if fs > fc, 
there are only early-type galaxies around early-type centrals. 
The fraction of satellites around early-type centrals that are 
early- type is {1 — fs) / {1 — fc) and thus the contribution to 
the satellite-satellite term is (1 — /c) x [(1 — /s)/(l — /c)]^. For 
fc > fs, the fraction of satellites around early- type centrals 
that are early-type is just 1 and the fraction around late- 
-ype centrals is {fc — fs)/fc = (1 — fs/ fc). We thus express 

ss, early aS. 
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{I- f sf/{l~ fc)\Ns{M)u{k\M) {fs>fc) (23) 

1 - /c + /c (1 - fs/fcf] Ns{M)u{k\M) if, < U) 

The requirements of Equations 20 through 23 segregate 
the early- and late-type galaxies as much as possible while 
maintaining the statistics of Equations of [17] and |18| This 
essentially results in a model where smaller mass halos will 
only be occupied by early- or late-type galaxies and larger 
mass halos will have a central early-type galaxy, many early- 
type satellite galaxies and room for a smaller (but signifi- 
cant) number of late-type galaxies. 

In the case of a cross-correlation, one must substitute 
^rieariyniate for Ug type- For both the central-satellite and 
satellite-satellite terms, only halos with early-type centrals 
will contribute if fs > ,fc and only halos with late-type cen- 
trals will contribute if fc > fs- Thus, 



the sample (using the Schlegel, Finkbeiner & Davis 1998 



ec.,ci(fc,M) 



2(/. 
2(/c 



fc) 
fs) 



(fs > fc) 
(fs < fc) 



(24) 



essAk,M) = 

2 [(1 - /.)(/. - /c)/(l - fc)] Ns(M)u{k\M) ifs > /c) (25) 
2/s(l - fs/fc)Ns{M)u{k\M) ifs < fc). 

These sets of equations account for all of the pairs of galax- 
ies that were present when the auto-correlation of the full 
sample weis measured. Thus, the expressions that repre- 
sent the fractions of pairs contributing to each term must 

add to one (i.e., 9cs,;ate/2 + Qcs.sarlyf^ + &cs,el/'2 = 1 
and ess,latc/{NsiM)u{k\M)) + ess.early/{NsiM)u{k\M)) + 

&as,ei/{Ns{M)u{k\M)) = 1). Inspection of our Q expres- 
sions reveals that this is indeed the case. 



2.3 Transformation to Angular Correlation 
Function 

In order to compare our measurements to our HOD model, 
we must Fourier transform the model power spectra to a 
real-space correlations function: 



C(0 



1 

2^ 



dk P{k,r)k'^ 



kr 



(26) 



and use Limber's equation to project the real-space model 
to angular space (assuming a flat Universe): 

Lo{e) = 

2/c/J" dz H{z)(dn/dz)^ du ^{r = y/u^ + x^z)9^) 

where c is the speed of light, H{z) is the expansion rate of 
the Universe, dn/dz is the normalized redshift distribution, 
and x(z) is the comoving distance to redshift z. 



3 DATA 

The data analyzed herein were taken from the SDSS DR5 



Abazajian et al. (20051. This survey obtains wide-field CCD 
photometry Gunn et al. ( 1998 1 in five passbands (u, g, r, i, z; 



e.g., [Fukugita et al.||1996[ ). The entire DR5 represents close 
to 8,000 square degrees of observing area. We selected galax- 
ies lying in the Northern, contiguous portion of the SDSS 
from the DR5 PhotoPrimary database and matched them 
to galaxies from the DR5 PhotoZ table. We constrained 



dust maps) to have reddening-corrected magnitudes in the 
range 18 ^ r < 21. We further masked our data by using 
the same pixelized mask of R07 (which cut on the SDSS 
DR5 survey area, seeing > l."5, r-band reddening > 0.2, 
bad pixels, satellite trails, etc.). This left 5,407 deg'^ of ob- 
served sky. Following the methods outlined in |Budavari et| 



(27) 



al. (20031, we created a volume limited sample with z < 0.3 
and Mr < —19.5 (we note that this same volume limited 
sample is used in R07). After masking, this sample, here- 
after denoted as Z3, contains nearly four million objects 
(3,980,652). 

We also subdivide these data samples by luminosity and 
type. We select galaxies with Air < —20.5 from Z3 to pro- 
duce a sample with just over 1.3 million galaxies (1,302750), 
which we denote Z3B. Each sample (Z3 and Z3B) is also 
split by galaxy type based on their type values from the 
DR5 PhotoZ table. As in R07, galaxies with t > 0.3 are put 
in our late-type sample and those with t ^ 0.3 are put in 
our early-type sample. The Z3 has nearly as many late-type 
galaxies (1,984,021) as early-type galaxies (1,996,631), while 
Z3B has significantly more early-type galaxies (820,789 to 
481,961). In total, this gives six galaxy samples for which 
we measure u}{6) and determine a best-fit halo occupation 
model. 

3.1 Redshift Distributions 

We require a knowledge of the redshift distribution for each 
of our galaxy samples in order to compare our observations 
to theoretical models. To build the redshift distribution for 
each sample, we treat each observed redshift as a Gaussian 
probability-density-function (PDF) with a equal to the es- 
timated error. The PDFs for each redshift were sampled in 
order to find the expected number of objects within bins of 
width 0.001 in z. We normalize these distributions to have 
unit area and use this result in Equation [27] The normalized 
dn/dz of our samples are plotted in Figure [l] The normal- 
ized distributions of all-, early-, and late-type galaxies are 
quite similar for the Z3 and Z3B samples, implying that a 
direct comparison between the two samples is justified. 

The un- normalized dn/dz is used to determine the num- 
ber of galaxies within a volume element defined by z + dz. In 
order to determine rig, we integrate over the entire redshift 
range with weights given by (dn/dz) ^: 



dz 



H{z) 



dn 



4,TTfobsX^{z)c dz 



(£) ' h'i^) 



where fobs is the observed fraction of the sky and is 0.131 
for our masked DR5 sample. By calculating Ug in this man- 
ner, we account for the non-negligible photometric redshift 
errors, which make the total volume occupied by the galax- 
ies in each of our samples larger than that of a truly volume 
limited sample. For each of our two main samples, we use 
this formalism to measure rig and compare this to the model 
Tig in Equation |lO] to determine the value of Mcut given Mo, 
a, and acut- 

When modeling the HOD of early- and late-type galax- 
ies, we are constrained by the fact that the HOD model 
fraction of late- types, fiate, must match the observed frac- 
tion. The HOD model fraction of late-types is determined 
via 
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Figure 1. The normalized redshift distributions for each of the 
six galaxy samples we use (z <0.3, Mr < —19.5, bottom; z < 0.3, 
Mr < -20.5, top) with the distribution for each full sample plotted 
in black, the late-type distributions plotted in blue, and the early- 
type distributions plotted in red. 



flate — 



dMn{M) [UM)N4M) + fs{M)N,{M)Ns{M) 



In this way, we determine fco for each given /so, (Teen, and 
(Jaat such that the model fiate matches the observed fiate- 



4 MEASUREMENT TECHNIQUES 

We calculate the angular 2-point correlation function, u;{6), 
of galaxies using the Landy & Szalay ( 1993 1 estimator: 



oj{e) 



DD{e) - 2DR{9) + RR(e) 
RR{e) 



(30) 



where DD (in our case) is the number of galaxy pairs, 
DR the number of galaxy-random pairs, and RR the num- 
ber of random pairs, all separated by an angular distance 
9 ± A9. We also calculate angular 2-point cross-correlation 
functions, for which we also employ the |Landy fc Szalay] 
1 1993 1 estimator: 



iOel{9) = 



DiD2{e) - DiRje) - D2R{9) + RR{9) 
RR{9) 



(31) 



where Di and D2 represent the two data samples that are 
being cross-correlated (note that the single random file can 
be used in our case since all of our samples have identical 
angular selections and that we use the subscript el because 
we will exclusively be cross-correlating early- and late-type 
galaxies). In every case, we mask our data and randoms by 
using the same pixelized mask of R07. 



4.1 Errors and Covariance 

We compute errors and covariance matrices using a method 
that estimates the statistical error associated with our an- 
gular selection and another that estimates the statistical er- 
ror associated with our radial selection. We use a jackknife 
method (e.g., Scranton et al. 20021, with inverse-variance 
weighting for both errors (e.g., Myers et al.|2005 20061 and 
covariance (e.g., Myers et al.|2007 l to account for the errors 
based on our angular selection; the method is nearly identi- 
cal to the method described in detail in R07. The jackknife 
method works by creating many subsamples of the entire 
data set, each with a small part of the total area removed. 
We found in ROT that 20 jack-knife subsamplings are suf- 
ficient to create a stable covariance matrix. These 20 sub- 
samples are created by extracting a contiguous grouping of 
l/20th of the unmasked pixels in 20 separate areas. Our 
covariance matrix, Cjack, is thus given by 



Jack — 

= §J2l^l[^fMS^) ~ ^k{9^)][u;full{9J) 



^k{9,)] 



(32) 



where uJkifi) is the value for the correlation measurement 
omitting the fcth subsample of data and i and j refer to the 
i*^ and j*'' angular bin. The jackknife errors are simply the 
square-root of diagonal elements of the covariance matrix. 

We must use a separate method to account for uncer- 
tainties introduced by our radial selection. In essence, we 
are attempting to measure the auto-correlation functions of 
galaxies for a given redshift distribution (since the redshift 
distribution figures prominently in our models). Our defined 
cuts on photometric redshift do not uniquely produce the 
(29)redshift distributions displayed in Figure [l] In order to ac- 
count for this, we re-sample the photometric redshift catalog 
to create 'perturbed' samples whose redshift distributions 
match those of the original sample. We take the redshift 
of each galaxy to be randomly selected from its PDF and 
re-calculate Mr based on this redshift. If these perturbed 
redshifts and magnitudes satisfy our selection criteria, they 
are included in the new sample of galaxies. In order to ade- 
quately reproduce the redshift distributions of Figure [l] we 
find that we can only allow galaxies with z < 0.32 into our 
perturbed sample. For the early- and late-type galaxies, we 
also perturb the type value based on the type-error (assum- 
ing it is Gaussian) when producing our perturbed samples. 
The type errors scale linearly with the photometric redshifts 
errors. Thus we use 



(33) 



where t is the galaxy type, t„ and z„ are the perturbed 
type and redshift, and at and Gz are the type error and 
photometric redshift error of each object obtained from the 
DR5 PhotoZ table. 

For each of our galaxy samples, we create ten perturbed 
samples. The percentage of galaxies that match between 
samples varies between 77% and 85% for any given par- 
ent sample (variation within any group of ten perturbed 
samples is less than 1%, e.g., the percentage of matching 
galaxies is always between 76.5% and 77.4% for late-type 
galaxies from the Z?,B sample). We calculate ijj(9) for each 
of the perturbed samples and calculate C^.- 



= J2k = l .fm[i^ave{9i) - UJk{9i)][LOave{9 j) - i^k{9j)] 



(34) 
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where LOavn is the average auto-correlation of each of the ten 
perturbed samples and fm is the average fraction of galaxies 
that match between sample k and the other nine samples. 
We find that typically Cij,z ~ O.SCijjack, meaning that 
they are small but non-negligible. 

We thus combine Cz and Cjack to obtain the full covari- 
ance matrix for each sample (i.e. Ci,j — Ci,jjack + C'i,^^). To 
properly constrain fit parameters, we minimize the using 
our covariance matrixes via the equation 



for a feature with a local maximum right at Mq. This is 



consistent with the density- morphology relation (Dressier 



(35) 



where ujm{d) refers to the model angular 2-point correlation 
function. 



5 MEASUREMENTS 

We have measured the angular 2-point correlation functions 
and found the best-fit HOD for galaxies in two luminosity 
threshold samples; Z3 (2 < 0.3 and Mr < -19.5), and Z3B 
{z < 0.3 and Mr < —20.5). For each sample we also found 
the parameters that best-fit the uj{9) of early- and late- type 
galaxies. The values of the best-fit HOD parameters for each 
full sample can be found in Table [l] and the best-fit param- 
eters for the early- and late-type samples can be found in 
Table [2] 

The top-left panel of Figure |2] displays the measured 
ui{6) for galaxies in the Z3B sample for all (black triangles), 
early- (red triangles), and late- (blue triangle) type galaxies 
with the best-fit model ll>{0) plotted with correspondingly 
colored lines (solid for all, dashed for late-, and dotted for 
early- type galaxies). The fit to all galaxies is excellent, as 
fitting between 0.003° and 1° (10 degrees of freedom) yields 
~ 3.9. The fit to the early- and late- type galaxies is 
tolerable, as for the 23 degrees of freedom ~ 19.2. The 
model performs slightly better for the early-type galaxies 
(X^ = 8.4), than for the late-types (x^ = 10.8). The largest 
discrepancies are at small scales, where the late-type model 
is too low. We note that if we had assumed that early- and 
late- type galaxies mix freely within halos, the fits would have 
been significantly worse, as we find the minimum x^ is 55 
for such a model when fitting the measurements over the 
same angular range. 

The top-right panel of Figure [2] displays the best-fit 
HOD for all (black), early- (red), and late- (blue) type galax- 
ies from the Z3B sample. The HOD for all galaxies shows 
inflection points around Mcut = 1.30 x lO^^/i'^M©, which 
defines the mass scale at which halos host a central galaxy, 
and Mo = 3.08 x 10"/i"^Mq, which defines the mass scale 
at which halos will host satellite galaxies. The best-fit late- 
type HOD shows a local minimum at close to 10^^h~^ Mq. 
This shape is a consequence of the model — the fraction 
of central late-type galaxies decreases as the mass increases 
and thus the late-type HOD decreases until the halos are 
massive enough to host satellite galaxies. Even so, the slope 
of the late-type HOD is significantly smaller than for the 
overall HOD, allowing the fraction of late- type galaxies to 
be the largest in small mass halos and smallest in high 
mass halos. This can be seen clearly in the top panel of 
Figure [3] where the fraction of late- type galaxies is plotted 
against halo mass. The decrease is nearly monotonic except 



[ipSOl , as the fraction of late- type galaxies decreases as halo 
mass increases. The model allows fc to increase from 0.38 
at M = Mcut to 1 near 3 x 1O"/!."^M0. We note that we 
also tried a model where fc = /cO for M < Mcut, but we did 
not obtain acceptable fits. 

The measured and best-fit model u}{9) for galaxies from 
Z3 are plotted in the bottom-left panel of Figure [2] The 
best-fit model to the entire sample is acceptable, as x = 1-9 
fitting between 0.003° and 1° (10 degrees of freedom). The 
model fits for the early- and late-type galaxies also per- 
form well, as the total x^ = 15.7 (23 degrees of freedom). 
Once again, the fit is slightly better for early-type galax- 
ies (x^ = 6.1) than for late-type galaxies (x^ = 9.6). The 
largest disagreements are at small scales where the late-type 
model is not quite large enough. Once more, we note that 
our model out-performs one in which we allow early- and 
late-type galaxies to mix freely within halos; we find the 
minimum for this class of model is 68 when fitting over 
the same angular range. 

The galaxies in Z3B form a brighter subset of the 
Z3 galaxies. Thus, as can be seen in the righthand pan- 
els of Figure |2j the HOD of each Z3 sample is larger than 
its Z3B counterpart at every mass scale (though they are 
all nearly equal at 10^^ Mq). The infiection points of 
the Z3 HOD for the full sample (occuring around Aleut ~ 
11.73 X lO"/!.~^A/0 and Mq = 1.29 x 10"/i"^Mq) are not 
as pronounced as for the Z3B sample, which is due to the 
larger value of acut = 0.7 (as compared to the acut ~ 0.4 for 
the Z3B sample). The best-fit HOD of the late- type galax- 
ies shows similar behavior to the Z3B late-type HOD, as 
once more the late-type fraction is greatest at small halo 
masses and least at large halo masses. The overall number 
of late-type galaxies is more than four times higher, and this 
very nearly matches the difference in the late-type HODs for 
Mhaio > 10^^ Mq. At smaller halo masses, there are more 
significant differences. The Z3 late-type HOD does not have 
a local minimum, only a significant inflection point. This is 
primarily due to the fact that the parameter which governs 
the decrease of fc, Ucen, has increased from 0.63 to 0.82. 

The bottom panel of Figure |3] displays the fraction of 
late-type galaxies for the best-fit HOD model of the Z3 sam- 
ple, as a function of halo mass. Again, there is a nearly 
monotonic decrease with a local maximum at M\. As plot- 
ted with dashed lines, this local maximum is due to a peak 
in the late-type satellite fraction. This peak exists at A4o be- 
cause, at smaller halos masses, the total fraction of galaxies 
that are satellites drops sharply and the fraction of galaxies 
that are late-type satellites must as well. (Note that the data 
is displayed such that the satellite and central fractions add 
to the total fraction.) 

The late-type satellite fraction is larger for the Z3 sam- 
ple, but this due primarily to the fact that the overall late- 
type fraction has increased from 0.37 to 0.498. The overall 
fraction of late-type galaxies that are satellites is smaller for 
the Z3 sample (as presented in Table 2, it is 0.118 for Z3 
and 0.149 for Z3B). This is due to the fact that the bulk of 
the late-type galaxies are central galaxies in low mass ha- 
los, and the total fraction of centrals is higher because the 
number density of halos is larger at small mass. Conversely, 
the satellite fraction of early-type goes up for the Zi sam- 
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Figure 2. The left panels display the measured angular auto-correlation functions for galaxies with Mr < 
— 20.5 (top) and Mr < —19.5 (bottom), for all (black triangles) early- (red squares) and late-type (blue 
circles) with lines that correspond to the best-fit model (black solid for all, red dotted for early-, and blue 
dashed for late-type). The right panels display the best-fit HOD for all (black solid), early- (red dotted), 
and late-(blue dashed) type galaxies for Mr < —20.5 (top) and Air < —19.5 (bottom). 

Table 1. The best-fit values of the HOD parameters and the associated values for the two main samples studied. All masses arc in units J\/q/i~^. 



Sample 


a 


log (Mcut) 


log(Afo) 


O'cut 


xVdof 


log {M,ff) 


h 


ng (/i^Mpc-^) 


fsat 


Z3B 


1 14+0.02 
1 9f:s+0-026 


11.866 
12.115 


13.llit0.01 
13.488l«:°?? 


n 74-0.06 
' -0.09 

41 +"■1=' 
'^•^^ -0.14 


1.9/10 
3.9/10 


13.13 
13.21 


1.09 
1.17 


0.0102 
0.0041 


0.148 
0.130 



Table 2. The best-fit values of the HOD parameters and the associated x values for the early- and late- type samples studied. 



Sample 


fcO 


fsO 






xVdof 




^1, early 


flate 


fs at, I ate 


f sat , early 


Z3 
Z3B 


0.38 
0.437 


0.38it0.02 


0.84it0.09 

63+° °* 


n sn+0 09 
0.35 ±0.05 


15.7/23 
19.2/23 


0.89 
0.98 


1.16 
1.27 


0.498 
0.370 


0.118 
0.149 


0.180 
0.118 
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Halo Mass (h-iMg) 

Figure 3. The fraction of galaxies tiiat arc late- type as a function 
of halo-mass. The top panel displays the information for galax- 
ies with Mj. < —20.5 and the bottom panel for galaxies with 
Mr < —19.5. In both panels, the full fraction is displayed with a 
solid line, the central fraction is displayed with a dotted line, and 
the satellite fraction is displayed with a dashed line. Note that 
the information is displayed such that the satellite fraction and 
central fraction add to the full fraction. 



0.1 r^„ (h-' Mpc) 1 10 




6(degrees) 

Figure 4. The measured cross-correlations of early- and late-type 
galaxies in for the Z3 sample, volume limited with z < 0.3 and 
Mr < —19.5 (bottom), Z3B sample, volume limited with z < 
0.3 and Mr < —20.5 (top) are displayed in black triangles. The 
appropriate model cross-correlation, determined using the best- 
fit HOD of the early- and late-type autocorrelations, is displayed 
with a solid black line in panel. 



pie. Essentially, the model implies that a majority of the 
late-type galaxies with —19.5 < Mr < —20.5 are central 
galaxies occupying low-mass halos and the majority of the 
early- type galaxies with —19.5 < Mr < —20.5 are satellites 
in higher-mass halos. 

We also measure the 2-point cross-correlation function 
of early- and late-type galaxies for each sample. These mea- 
surements are plotted in Figure |4] (black triangles) along 
with the model uiei{9) that results from using the best-fit 
parameters determined from the autocorrelation measure- 
ments. The models appear close to the measurements, and 
the are 16.1 for Z3 and = 15.3 for Z3B. These values 
are impressive considering the size of the error bars and the 
fact that we did not specifically fit for these measurements. 
Once again, the best-fit models which include mixing are sig- 
nificantly worse (x^ = 92.4 for Z3 and x^ = 86.6 for Z3B). 
Thus, in every example, our model, which segregates early- 
and late-type galaxies to the maximal extent, performs sig- 
nificantly better than one in which the early- and late-type 
galaxies are allowed to mix freely. 

The model cross-correlation functions match the mea- 
surements quite well on large scales, which is what one 
sh ould expect, as the 2- halo cross-power spectrum is just 
P{k, z)iateP{k, z)eariy The agreement suggests that there 
are not any major systematic problems in our construc- 
tion of the redshift distributions of the early- and late-type 
galaxies. When we model the angular cross-correlation, effec- 
tively there is a term J {dn/dziate){dn/dzeariy)- If the true 
redshift distributions differ greatly from those we estimate, 
then our models would not be able to simultaneously fit 



the large scale autocorrelation and cross-correlation func- 
tion measurements. 



6 DISCUSSION 

We have measured the angular auto-correlation functions of 
galaxies photometrically selected from the SDSS DR5. We 
have used these measurements to constrain the HOD of the 
galaxies and determine its dependence on luminosity, and 
galaxy type. We have found that that the fact we are us- 
ing photometric redshifts requires a special prescription for 
determining number densities (see ^3.1[| an d introduces an 
extra source of statistical error (see S4.ll. Most interest- 



ingly, we have found that in order to simultaneously model 
the clustering of early- and late- type galaxies and their cross- 
correlation, we cannot allow them to mix freely within halos. 

Further insight can be gained by looking at the real- 
space 2-point correlation functions of our best-fit models. 
The ^(r) of the early-, all, and late-type galaxies are dis- 
played in the top, middle, and bottom panels of Figure [5] 
with solid and dotted lines representing the Z3 and Z3B 
samples. Due to the wide range in scales, we multiply each 
^ by r'^, which allows the differences between each correla- 
tion function to be seen clearly. The differences between the 
model ^(r) of the Z3 and Z3B samples are in line with what 
one would expect. The samples share the same median red- 
shift {z ~ 0.25), so direct comparison is valid. The model 
^aii{r) of Z3B is consistently higher than that of Z3, as one 
would expect given the differences in luminosity. Interest- 
ingly, the ^(r) of the early- and late-type samples increase 
by a smaller factor than the full sample. The f (r) amplitudes 
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Figure 5. The best-fit model real-space 2-point correlation func- 
tions multiplied by for early- (top), all (middle), and late- 
(bottom) type galaxies for the Z3 sample, volume limited with 
z < 0.3 and Mr < —19.5 (solid lines), Z3B sample, volume lim- 
ited with 2 < 0.3 and Mr < -20.5 (dotted lines). 



for the Z3B sample are bolstered not only by the increase 
in luminosity but also due to the decrease in the fraction of 
late-type galaxies (~ 0.5 compared to ~ 0.37). 



6.1 Comparison With Previous Results 

We can investigate further by comparing our work with that 
of Z05, who found the best-fit HOD for galaxies from the 
spectroscopic portion of the SDSS (z ~ 0.1). The model 
used by Z05 also did not include a a^ut parameter. |Tinker| 
|et al.| 12008] ) constrained the HOD of SDSS galaxies using a 
HOD model that does include acut, but the facts that Z05 
constrain galax;y samples with luminosity thresholds that 
match ours {Mr < —19.5 and Mr < —20.5) and that the 
results of Tinker et al. ( 2008| generally agree with Z05, make 
it a more appropriate reference. 

Despite the differences in the samples used and in the 
models, the differences between the Z05 best-fit model pa- 
rameters and ours can be explained by the fact that Z05 
used a fairly different cosmological model. In order to com- 
pare our results to those of Z05, we calculate real-space 2- 
point correlation functions using the Z05 cosmological model 
(i^totai = 1, = 0.3, h = 0.7, r = 0.21, erg = 0.9) 
and best-fit HOD parameters. In Figure |6] we display our 
best-fit real-space 2-point correlation functions for our Z3 
and ZSB samples (solid lines) compared to the Z05 best- 
fit real-space 2-point correlation functions for Mr < —19.5 
and Mr < —20.5 (dotted lines). Once again we multiply 
each ^ by r^, in order to clearly see the differences between 
the respective correlation functions. We multiply the Z05 
correlation amplitudes by a factor that allows for passive 
evolution between z = 0.25 and z = 0.1 given by (see, e.g., 
Wake et al.|2008 1 



Figure 6. The top panel displays the best-fit model real-space 
2-point correlation functions multiplied by for our data vol- 
ume limited with z < 0.3 and Mr < —19.5 (solid lines) com- 
pared to the best-fit model real-space 2-point correlation func- 
tions from Z05 with Mr < —19.5 (dashed lines) and the same 
model shifted to 2 = 0.25 assuming passive evolution (dotted 
lines). The bottom panel displays the same information for sam- 
ples with Mr < -20.5. 



Cw(r) 
Cio(r) 



ho 



(36) 



where the subscripts lo and hi refer to the appropriate fac- 
tors at the lower and higher redshifts and D is the linear 
growth factor (see, e.g.. Mo fc White ||1996 l. We note that 
applying this factor simply allows for a proper comparison 
between the two clustering signals — we are not arguing that 
galaxies with Mr < —19.5 should passively evolve between 
z = 0.25 and 0.1. The comparison is particularly apt for the 
A'lr < —19.5 samples, as the co-moving number densities 
are the same to 3 significant figures (0.0102 /i^Mpc"^), and 
slightly less so for the Mr < —20.5 sample as the number 
density in our sample is roughly 33% higher than the Z05 
sample. 

For both samples, the correlation functions have similar 
amplitudes at large scales, but the Z05 amplitudes are sig- 
nificantly larger at small scales. This is what we would gen- 
erally expect, as merging halos between 0.25 and 0.1 should 
increase the overall satellite fraction and thus increase the 
amplitude of the one halo term in the correlation function. 
This is indeed the case in the best-fit models. As presented 
in Zheng et al. (20071, the satellite fraction is ~0.2 for 
galaxies from the SDSS spectroscopic, while ours is ~0.15. 
The decrease is perhaps slightly more than one would expect 
between 2 ~ 0.1 and z ~ 0.25, but the general trend is as 
expected. We therefore do not find any significant disagree- 
ment between the clustering of galaxies in our photometric 
samples and that of galaxies from the SDSS spectroscopic 
sample. 

On the other hand, our results differ from those of Z05 
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with respect to the early- and late-type galaxies. The man- 
ner in which Z05 separated galaxies into different samples 
differs slightly from ours, as Z05 uses a Js'-corrected color 
cut and we use an estimated spectral type. One would ex- 
pect the galaxies Z05 classify as "blue" to predominantly 
be late-type (and the ones Z05 classify as "red" to predom- 
inantly be early-type). It is thus somewhat surprising that 
the Z05 measurements were well-fit by a model allowing red 
and blue galaxies to mix freely within halos — while we 
find, that in all cases, a model that separates the early- and 
late-type galaxies to the maximal extent possible (given the 
statistics) provides a much better fit to our data. The dis- 
crepancy is likely due to the fact that the smallest scales are 
most strongly affected by this treatment of the galaxies and 
that we probe significantly smaller scales than Z05 (our fits 
extend to req ~ 0.03 Mpc). 

We can also easily compare our results to R07, as we use 
the same volume limited samples and early-/late-type splits. 
Comparing the b\ values of the full samples, they are quite 
close, but slightly inconsistent given the la errors quoted 
in R07. This is due to the fact that in R07 the 61 values 
were measured directly from the data, while the 61 values 
quoted in this work are from the models derived from the 
best-fit HOD. Therefore, one would expect slight disagree- 
ment. The disagreement is never greater that 4%, so this 
is not worrisome. The disagreement is greater for the early- 
and late-type samples, which is likely due to the treatment 
of the photometric redshift distributions. R07 determined 
a model u>2, dm using the redshift distribution for all of the 
galaxies in each respective sample and used it to find the all, 
early-, and late-type 61 parameters for the sample. As can 
be seen in Figure [l] the redshift distributions of the early- 
and late-type galaxies show different shapes. In general, the 
early-type distributions are more narrow, and the late-type 
distributions wider than the full distribution. Thus, using 
the full sample to find 61 of late-type galaxies would cause 
the parameter to be under-estimated, and using it to find hi 
of early-type galaxies would cause the parameter to be over- 
estimated. This appears to be the case for R07, as the b\ of 
the late-type galaxies are consistently smaller than ours and 
the early-type 61 are consistently higher. 



6.2 Mixing 

Our best-fit models of early- and late-type galaxy clustering, 
constrained via our auto-correlation measurements, provide 
good fits to the respective auto-correlation measurements 
and also acceptable fits to the cross-correlation measure- 
ments (see Figure [4|. We have investigated changing other 
parameters in the models to determine if there is an alterna- 
tive course to separating early- and late-type galaxies into 
different halos, but none have provided acceptable fits. For 
example, we have tried a wide range of models for the den- 
sity profiles of the late-type galaxies within halos that are 
different from the standard NFW (though always spherically 
symmetric), but we were not able to significantly improve 
the model fits (either while allowing mixing or not). We 
have also tried using different forms for the concentration 
parameter of late-type galaxies, but again, this produced no 
meaningful improvement. Given the data at hand, we are, 
therefore, convinced that our model represents the optimal 



way of modeling the clustering of early- and late-type galax- 
ies. 

By looking at the measured uli{9), one can see why mod- 
els with mixing can not reproduce our measurements. For 
any model that allows even mixing, if a model for late-type 
galaxies is forced to be closer to the model for all galaxies, 
the early-type model will have to do the same (in the case 
where the number densities of early- and late-type galaxies 
are equal, the response should be entirely symmetric). Our 
measurements for both galaxy samples show the measured 
uij(9) of late-types getting closer to the measured uj{6) of all 
galaxies as the scale gets smaller, while the early-type mea- 
surements do not get any closer to the measurements for 
all galaxies. Segregating the galaxies allows more freedom 
in each galaxy-type's model lo{9) relative to the model for 
all of the galaxies. This can be illustrated by imagining two 
samples that are completely segregated but have identical 
clustering properties within halos. In combining these two 
samples, the number of close pairs will only double (along 
with the total number of objects) and the correlation func- 
tion for the entire sample will be half as large as for either 
of the original samples. In the case where the two samples 
are mixed within halos, combining the two samples would 
quadruple the number of close pairs, thus producing the 
same result for the correlation function. 

The shapes of the model ujei{0) elucidate the mini- 
mum degree to which Equations [Tt] and [18] require mixing 
of galaxy types. If, for example, the galaxy types were al- 
lowed to be completely segregated, the model cross-power 
spectrum one halo term would be zero and the model cross- 
correlation would be completely flat at small scales. Clearly, 
the best-fit models are not allowing such extreme segrega- 
tion. In most cases, Equations |17| and |18| will require that 
many late-type galaxies are satellites of central early-type 
galaxies, who also have many early-type satellites (this will 
happen for any halo mass where fs is greater than fc). 



7 CONCLUSIONS 

We have measured the angular 2-point correlation functions 
of galaxies drawn from volume limited samples of SDSS DR5 
galaxies with z < 0.3, Mr < -19.5 and z < 0.3, Mr < -20.5, 
each of which are further subdivided into early- and late- 
type galaxy subsamples. By modeling the angular 2-point 
correlation function, we have shown, for the first time, that 
the best halo model is one in which early- and late-type 
galaxies are segregated to the maximal extent possible. Pre- 
vious studies (such as Z05) modeled the clustering of red 
and blue galaxies (which should predominantly be early- 
and late-type galaxies, respectively) by allowing mixing be- 
tween the galaxy types within halos; these studies, however, 
did not probe to the same small scales we have, which is 
where the models that allow mixing disagree the strongest 
with our measurements. 

We plan to follow-up this work by using data from the 
SDSS DR7 to constrain the HOD as a function of redshift. 
The analysis techniques presented in this work provide a 
foundation upon which to base this extension and the im- 
proved photometric redshifts of the DR7 data should enable 
a reliable determination of the evolution of clustering as a 
function of galaxy type to z < 0.4. 
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